Picture for Xuchao Zhang

Xuchao Zhang

Memora: A Harmonic Memory Representation Balancing Abstraction and Specificity

Add code
Feb 03, 2026
Viaarxiv icon

Your Self-Play Algorithm is Secretly an Adversarial Imitator: Understanding LLM Self-Play through the Lens of Imitation Learning

Add code
Feb 01, 2026
Viaarxiv icon

Adapting Web Agents with Synthetic Supervision

Add code
Nov 08, 2025
Viaarxiv icon

LEGOMem: Modular Procedural Memory for Multi-agent LLM Systems for Workflow Automation

Add code
Oct 06, 2025
Figure 1 for LEGOMem: Modular Procedural Memory for Multi-agent LLM Systems for Workflow Automation
Figure 2 for LEGOMem: Modular Procedural Memory for Multi-agent LLM Systems for Workflow Automation
Figure 3 for LEGOMem: Modular Procedural Memory for Multi-agent LLM Systems for Workflow Automation
Figure 4 for LEGOMem: Modular Procedural Memory for Multi-agent LLM Systems for Workflow Automation
Viaarxiv icon

Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search

Add code
Jun 10, 2025
Figure 1 for Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search
Figure 2 for Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search
Figure 3 for Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search
Figure 4 for Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search
Viaarxiv icon

Anyprefer: An Agentic Framework for Preference Data Synthesis

Add code
Apr 27, 2025
Figure 1 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Figure 2 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Figure 3 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Figure 4 for Anyprefer: An Agentic Framework for Preference Data Synthesis
Viaarxiv icon

Synergistic Weak-Strong Collaboration by Aligning Preferences

Add code
Apr 22, 2025
Viaarxiv icon

AMPO: Active Multi-Preference Optimization

Add code
Feb 25, 2025
Viaarxiv icon

Verifiable Format Control for Large Language Model Generations

Add code
Feb 06, 2025
Viaarxiv icon

REFA: Reference Free Alignment for multi-preference optimization

Add code
Dec 20, 2024
Viaarxiv icon